Retrieving BioMedical Information: Challenges and Possibilities

نویسنده

  • Heri Ramampiaro
چکیده

A large amount of biomedical information is available to researchers today, and it is continuously increasing. As a result, researchers widely agree that the ability to precisely retrieve desired information is vital to use the available knowledge. A way to achieve this is providing a retrieval system that is not only able to retrieve the available and sought information, but also to filter out irrelevant documents, while giving the relevant ones the highest ranking. The main goal of this work has been to investigate how to improve the ability for a system to find and rank relevant documents. Our method is based on applying series of information retrieval techniques to search in biomedical information and combine them in an optimal manner. These techniques include extending and using well-established information retrieval (IR) similarity models like the TF-IDF and BM25 as the scoring schemes, and applying personalisation so that researchers may affect the ranking based on their view of relevance. The techniques have been implemented and tested in a proof-of-concept prototype called BioTracer, extending a Java-based open source search engine library. The preliminary results from our experiments using the TREC 2004 Genomic Track collection seem satisfactory, with the best mean average precision (MAP) of 0.5129 and the best precision at 100 retrieved documents (P@100) of 0.473. What can be concluded from these results is that involving the users in the search will often have positive effects on the ranking of search results, and that our BioTracer system represents a tool that may be able to meet the user’s information needs.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Web Crawling Agents for Retrieving Biomedical Information

Autonomous agents for topic driven retrieval of information from the Web are currently a very active area of research. The ability to conduct real time searches for information is important for many users including biomedical scientists, health care professionals and the general public. We present preliminary research on different retrieval agents tested on their ability to retrieve biomedical ...

متن کامل

Comparison of Bibliographic Databases in Retrieving Information on Telemedicine

Background & Aims: Some of the main questions which can be of importance for those researchers who intend to perform a systematic review in a field of science are: ‘What databases should I use for my review?’; ‘Do all these databases have the same value?’; and ‘Which sourcesretrieved the highest of relevant references?’. The main aim of this work was the identification of the best database for ...

متن کامل

An automatic method for retrieving and indexing catalogues of biomedical courses.

Although there is wide information about Biomedical Informatics education and courses in different Websites, information is usually not exhaustive and difficult to update. We propose a new methodology based on information retrieval techniques for extracting, indexing and retrieving automatically information about educational offers. A web application has been developed to make available such in...

متن کامل

A Short Survey of Biomedical Relation Extraction Techniques

Biomedical information is growing rapidly in the recent years and retrieving useful data through information extraction system is getting more attention. In the current research, we focus on different aspects of relation extraction techniques in biomedical domain and briefly describe the state-of-the-art for relation extraction between a variety of biological elements.

متن کامل

Acquiring, Storing and Retrieving Diverse Biomedical Data Using the World-Wide-Web: The SenseLab Paradigm

The complexity of biomedical data creates unique challenges in their acquisition, storage and retrieval. Recent advances in the world-wide-web, database software and database-to-web middleware combined with the acceptance and use of the Internet in the scientific community create a strong framework to face these challenges. We describe SenseLab as a paradigm of a project that integrates the fac...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010